Flexible theft and resolute punishment: Evolutionary dynamics of social behavior among reinforcement-learning agents
نویسندگان
چکیده
Existing models of the evolution of social behavior typically involve innate strategies such as tit-for-tat. Yet, both behavioral and neural evidence indicates a substantial role for learned social behavior. We explore the evolutionary dynamics of two simple social behaviors among learning agents: Theft and punishment. In our simulation, agents employ Q-learning, a common reinforcement learning algorithm. Agents reproduce in proportion to the objective rewards they accrue, but the subjective reward function that guides learning and action evolves by natural selection. We find that agents typically evolve a bias to punish thieves that is sufficiently strong that it cannot be unlearned. Agents also typically evolve a bias to abstain from theft, but this is weak enough to permit rapid learning. This flexibility allows would-be thieves to exploit non-punishers. Finally, we show qualitatively similar results in a behavioral experiment on human participants: Flexible theft, but resolute punishment.
منابع مشابه
Evolution of flexibility and rigidity in retaliatory punishment.
Natural selection designs some social behaviors to depend on flexible learning processes, whereas others are relatively rigid or reflexive. What determines the balance between these two approaches? We offer a detailed case study in the context of a two-player game with antisocial behavior and retaliatory punishment. We show that each player in this game-a "thief" and a "victim"-must balance two...
متن کاملThe Effect of Electronical Media on the Reinforcement of Social Behavior of Youth from the Computer Course Professors and Students Viewpoints of Sari Islamic Azad University
The goal of research was the effect of electronical learning media on the reinforcement of youth social behavior from the point of view of computer course professors and students of Islamic Azad University of Sari. The statistical population was included of all computer students and professors of I.A.U of Sari. The statistical sample was identified by using of the sample content identification ...
متن کاملNumerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma.
Humans and other animals can adapt their social behavior in response to environmental cues including the feedback obtained through experience. Nevertheless, the effects of the experience-based learning of players in evolution and maintenance of cooperation in social dilemma games remain relatively unclear. Some previous literature showed that mutual cooperation of learning players is difficult ...
متن کاملHierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملAn Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources
This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014